High Dimensional Similarity Search With Space Filling Curves
نویسندگان
چکیده
We present a new approach for approximate nearest neighbor queries for sets of high dimensional points under any L t-metric, t = 1; : : : ; 1. The proposed algorithm is eecient and simple to implement. The algorithm uses multiple shifted copies of the data points and stores them in up to (d + 1) B-trees where d is the dimensionality of the data, sorted according to their position along a space lling curve. This is done in a way that allows us to guarantee that a neighbor within an O(d 1+1=t) factor of the exact nearest, can be returned with at most (d + 1) log p n page accesses, where p is the branching factor of the B-trees. In practice, for real data sets, our approximate technique nds the exact nearest neighbor between 87% and 99% of the time and a point no farther than the third nearest neighbor between 98% and 100% of the time. Our solution is dynamic, allowing insertion or deletion of points in O(d log p n) page accesses and generalizes easily to nd approximate k-nearest neighbors.
منابع مشابه
On Querz Processing and Optimality Using Spectral Locality-Preserving Mappings
A locality-preserving mapping (LPM) from the multidimensional space into the one-dimensional space is beneficial for many applications (e.g., range queries, ncarestneighbor queries, clustering, and dedustcring) when multidimeIlsional data is placed into one-dimensional storage (e.g., the disk). The idea behind a locality-preserving mapping is to map points that are nearby in the multidimensiona...
متن کاملNorm-Based Locality Measures of Two-Dimensional Hilbert Curves
A discrete space-filling curve provides a 1-dimensional indexing or traversal of a multi-dimensional grid space. Applications of space-filling curves include multi-dimensional indexing methods, parallel computing, and image compression. Common goodness-measures for the applicability of space-filling curve families are locality and clustering. Locality reflects proximity preservation that close-...
متن کاملUnconventional Space-filling Curves
A 2-dimensional space-filling curve, here, is a surjective continuous map from an interval to a 2-dimensional space. To construct a new space-filling curve distinct and much different from Hilbert’s, Peano’s, and the Z-order, we first observe that all three of these curves are based on self-similar fractals–in particular, the recursive definition of the sequences of functions that converge to t...
متن کاملOn the Analysis of High Dimensional Space-Filling Curves
Space-filling curves map the multi-dimensional space into the one-dimensional space. A space-filling curve acts like a thread that passes through every cell element (or pixel) in the n-dimensional space so that every cell is visited only once, i.e., the space-filling curve does not self-intersect. Thus, a space-filling curve imposes a linear order of the cells in the n-dimensional space. This i...
متن کامل3D Hilbert Space Filling Curves in 3D City Modeling for Faster Spatial Queries
The advantages of three dimensional (3D) city models can be seen in various applications including photogrammetry, urban and regional planning, computer games, etc. They expand the visualization and analysis capabilities of Geographic Information Systems on cities, and they can be developed using web standards. However, these 3D city models consume much more storage compared to two dimensional ...
متن کامل